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Abstract. We argue that the freezing transition scenario, previously conjectured 
to occur in the statistical mechanics of 1//— noise random energy models, governs, 
after reinterpretation, the value distribution of the maximum of the modulus of the 
characteristic polynomials pn(&) of large N x N random unitary (CUE) matrices Un; 
i.e. the extreme value statistics of pn{9) when N — > oo. In addition, we argue that it 
leads to multifractal-like behaviour in the total length ^n(x) of the intervals in which 
\pn(8)\ > N x ,x > 0, in the same limit. We speculate that our results extend to the 
large values taken by the Riemann zeta-function £(s) over stretches of the critical line 
s = 1/2 + it of given constant length, and present the results of numerical computations 
of the large values of £(1/2 + it). Our main purpose is to draw attention to possible 
connections between extreme value problems in the statistical mechanics of 1//— noise 
random energy models, random matrix theory, and the theory of the Riemann zeta 
function, and to the potential consequences of freezing in the latter two cases. 



1. Introduction 

Over the past 40 years, considerable evidence has accumulated for connections between certain 
statistical properties of the Riemann zeta-function, C( s )> an d those of large random matrices. 
For example, correlations between the nontrivial zeros of the zeta function on the critical 
line s = 1/2 + it, t € R, are believed to coincide, in the limit t — > oo, with those between 
the eigenvalues of large random unitary or hermitian matrices, and the value distribution of 
C(l/2 + it) is believed to be related to that of the characteristic polynomials of large random 
unitary or hermitian matrices. Our purpose here is to connect these two areas of research to 
a third, the statistical mechanics of disordered energy landscapes. The analogy we develop 
suggests that the freezing transition observed in the statistical mechanical problem also governs 
the extreme values taken by the characteristic polynomials of random matrices and the zeta 
function. This sheds new light on the longstanding problem of determining the maximum size 
of the zeta function. 

The Riemann zeta-function 

<w-f:i-n(>-^)" w 

n=l P 

is of central importance in mathematics because it encodes the distribution of the primes p 
in the positions of its non-trivial zeros. The Riemann Hypothesis places these zeros on the 
critical line. Some of the most important questions in the theory of the zeta function concern 
the distribution of values it takes on the critical line. It was proved in Selberg 1944 & 1946, 
for example, that, when t € R, log |£(l/2 + it)\ satisfies a central limit theorem: 



lim -^-meas < 

T — >oo T 



T <t <1T : a < 1 °gl£M±J < A = » /" expf-^)^ (2) 
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which implies that its typical size is of the order of \T°g log t when t — > oo (see, e.g., Titchmarsh 
1986 and Laurincicas 1996). As regards the exceptionally large values taken by the zeta function 
over long ranges, the Lindelof hypothesis asserts that |C(l/2 + it)\ = o(t e ) for any e > 0, the 
Riemann Hypothesis implies that 

| C (l/2 + if )| = o(exp(igg)), (3) 



Freezing Transitions and Extreme Values 



3 



where c\ is a constant, and, unconditionally, we know that 



ic(i/2 +i( )i=«(; X py^5yj, < 4 > 

meaning that {((1/2 + it)\ takes the value in the argument on the right-hand side infinitely 
often (see, e.g. , Titchmarsh 1986). The exceptionally large values of |£(l/2 + it)| thus lie in the 
range between ([3]) and Q . The problem of determining where precisely within this range they 
lie has attracted considerable attention in recent years, but remains unresolved. The extreme 
values in question are so rare that extensive numerical computations have thus far failed to 
settle the matter. 

It was observed by Montgomery (in relation to a conjecture, to be described below, of 
Farmer, Gonek & Hughes, see Farmer, Gonek & Hughes 2007) that treating the local maxima 
of log |C(l/2 + it)\ as being statistically independent, assuming that their values satisfy the 
central limit theorem ([2]), and using the fact the the number of local maxima of |C(l/2 + z£)| in 
< t < T is of the order of log ^ (the number of zeros in the range) , implies that the typical 



size of the maximum value of |C(l/2 + is of the order of exp yc2\/log(t) log log(t)J , where 
C2 is a constant]!] Note that this is considerably larger than the typical size of |C(l/2 + it)\, 
and that it is closer to Q than to ([3]), implying that the extreme values are not much larger 
than the largest value known to be reached infinitely often. Significantly, this calculation 
makes clear that the square root in the estimate of the maximum size is related to the fact 
that the exponential in the integrand on the right hand side of ([2]) is quadratic in x. Had the 
exponential been linear in x then the result would have been closer to the upper limit g). 
It is worth remarking that the Montgomery model would predict a Gumbel distribution (see 
Section 3) for the fluctuations of the extreme values of |C(l/2 + it)\ around this typical size. 

It is important to note that Montgomery's observation rests on two key assumptions. The 
first is that the central limit theorem ^ extends out to the range of the large values predicted. 
This is far beyond the range for which it has been established. The second is that the values 
of the local maxima of |C(l/2 + it)\ are uncorrelated. In fact, the values of log |C(l/2 + it)\ are 
known to be correlated in a way that is significant from the point of view of the ideas we shall 
explore here. Specifically, let us define, for a fixed i El, 

r(0 , > ... (\ ■ A. _ . J\ 



V^V) = -21og|C( £ + i(t + a:)) I = -2RelogC f ^ + i(t + x)\ . (5) 

ghe factor of —2 is introduced for reasons to be explained below.) The central limit theorem 
implies that, when t — > oo, V^\x) behaves like a Gaussian random function of x. To 
characterize such a random process it is natural to consider the two-point correlation function 
(v^\xi)V^\x2)^ , with brackets (...) denoting the average over an interval [t — h/2, t + h/2] 
such that j^-j <C h <t. A simple argument, sketched in Appendix A, shows that when t — > oo 

vi<\ xi )v«\x 2 )) « ( - 2lo f i ~ x f ;° r | xi -f 21 « 1 (6 ) 

* V ' * V V \ 21oglogt, for ki-xal^j^ v ' 

This illustrates the fact that values of log |£(l/2 + it)| are indeed correlated (see also Bourgade 
2010). The significance of the precise form of the correlations will become apparent when 

\ As will be discussed in Section 3, the typical size of the maximum of M i.i.d. normal, mean-zero 
random variables Vi with variance Ejl^ 2 } = u < oo behaves asymptotically like clm ~ y/2a log M. The 
estimate for the maximum modulus of the ^— function follows from considering M — ^ log samples 
drawn independently from the gaussian distribution in with variance a = h log log (t/2w). 
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we draw comparisons with corresponding problems in random matrix theory and statistical 
mechanics. 

Over the past decade it has become a well-established paradigm that many statistical 
properties of the Riemann zeta function along the critical line can be understood by comparing 
them to analogous properties of the characteristic polynomials of random matrices (Keating & 
Snaith 2000, Hughes, Keating & O'Connell 2001, Conrey et al. 2005, Gonek, Hughes & Keating 
2007, Conrey et al. 2008 ). Let C/jv be an N x N unitary matrix, chosen uniformly at random 
from the unitary group U{N) (i.e. Un lies in the Circular Unitary Ensemble, or CUE, of 
random matrices), and denote its eigenvalues by exp (i<j>\), ■ ■ ■ , exp (i^jv). Let 

N 

p N {9) = det (l - U N e~ w ) = H (l - e *(<^)) (7) 

n=l 

be the corresponding characteristic polynomial. To this end it is instructive to compare V t i0 (x) 
from (5) with V N U) (0) = -21og|pjv(0)|. (A gain, the factor of —2 is introduced for reasons to 
be explained below.) It was proved in Keating & Snaith 2000 that V N U \0) satisfies a central 
limit theorem that is the analogue of Q. Specifically, the values of log|pjv(#)| normalized 
by yj | log N, have a limiting distribution as N — > oo given by the right-hand side of (2). 
Identifying the mean density of the eigenvalues, N/2ir with the mean density of the Riemann 
zeros near to height t, log renders the agreement complete. 

Importantly for us here, has the following representation (Hughes, Keating 8z 

O'Connell 2001): 

00 1 1 
4 C/) W = E^K in H jV) + comp. conj.], v W = —Tt(U n ) (8) 



n=l v v 

According to Diaconis & Shahshahani 1994, the coefficients Vn for any fixed finite set 
of integers n tend, in the limit N — > oo, to i.i.d. complex gaussian variables with zero 
mean and variance E{|w n | 2 } = 1. A simple calculation (cf. ( 44|45 ) below) then shows that 
^ {Vn ] (Qi)Vn ] (h)} tends in the limit N -> oo to -21og2| sin §(0 a - 9 2 )\, and so exhibits 
precisely the same logarithmic behaviour at small distances as we found for the zeta-function. 
For large but finite N, the logarithmic divergence can be shown to saturate at \9i — #2! ~ 
so after associating N ~ log the correspondence between (6) and (x) becomes 
complete. This is significant from the point of view we seek to develop. 

Our goal here is to determine the maximum value of \pn(8)\ over some specified interval 
< 9 < L, or, more precisely, the distribution of these maximum values when Un ranges over 
U{N). Our second goal is then to use the random- matrix results to motivate predictions for 
the extreme values of the Riemann zeta function. The first steps in this direction were taken 
in Farmer, Gonek & Hughes 2007, who determined the tail of the distribution of the maximum 
values of |pjv(0)| when L = 2ir. They modeled ^(1/2 + it) in the range < t < T by the 
characteristic polynomials of a set of approximately T independently chosen random matrices 
(recall that the number of zeros in the range is of the order of TlogT, and the identification 
N ~ logT). This corresponds to asking for the typical size of the maximum value that 
\pn(9)\ takes when < 9 < 2ir and Un is sampled independently a large (exponentially 
in N) number of times from within U(N). The resulting conjecture is in accord with the 
Montgomery heuristic (and the random-matrix approach is sufficiently refined to predict a 
value for the constant C2). This is not altogether surprising, because, as discussed above, the 
central limit theorem for log \pn(9)\ corresponds to that for log |£(l/2-Mi)|, and, moreover, in 
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the random matrix case we know that the gaussian distribution extends to the large deviation 
regime (Hughes, Keating & O'Connell 2001). 

The focus here will differ from that of Farmer, Gonek & Hughes 2007 in the following 
ways. We shall be concerned with the maximum values of the characteristic polynomials of 
single matrices, rather than with large numbers of matrices, and will obtain the full value 
distribution of the maxima in the limit as N — > oo, rather than concentrating on the tail that 
is relevant when maximizing over many matrices. This leads to a model for the distribution 
of maximum values of |C(l/2 + it)\ over T < t < T + L, L < 2tt, rather than < t < T 
as T — > oo. Furthermore, it makes the problem of numerical computation of the distribution 
in question significantly easier, because one is finding the maximum only of ~ LlogT rather 
than ~ T log T numbers (the values of the local maxima) . 

Our main purpose here is to link the problems of finding the extreme value statistics of 
the characteristic polynomials of random matrices and of |C(l/2 + it)\ to an interesting and 
important class of problems in statistical mechanics. The maximum value of pn{6) over the 
interval in question can be characterized in terms of the moments 

AT /•£ AT rL 

Z N ((3;L) = -j o \p N (0)\WdB = — J e-WWdO, (3 > (9) 

where V N {9) = -2log\p N {9)\. Specifically, if F{p) = -p' 1 log Z N (f3), then 

lim T(B)= min V N (9) = min [-2 log \pn(0)\] = -2 max log\p N (6)\.(W) 

/3->oo 0e(o,L) 0e(o,L) 0e(o,L) 

The key point is that Q takes the form of a partition function for a system with energy 
Vn{9) and inverse temperature /?, and J-(/3) may then be associated with the corresponding 
free energy. Recalling that the values of Vn{9) are gaussian distributed and logarithmically 
correlated, it is natural to draw comparisons with a class of problems that has attracted a good 
deal of attention recently in the area of disordered systems, namely the statistical mechanics 
of a single particle equilibrated in a random potential energy described by a gaussian random 
processes with logarithmic correlations. For example the statistical mechanics of systems in 
which the energy is given by a random Fourier series, similar to Q was addressed in Fyodorov 
& Bouchaud 2008 and Fyodorov, Le Doussal, & Rosso 2009. In the statistical mechanical 
problem there has been a particular focus on the freezing transition which dominates the low 
temperature limit and determines the extreme value statistics. We shall argue that a similar 
freezing transition determines the extreme value statistics of the characteristic polynomials and 
hence, conjecturally, of |£(l/2 + zi)|, and that therefore the logarithmic correlations exhibited 
by Vjy (0) and V^{x) play an important role. Explaining this observation represents our 
main objective. The implications are wide-ranging, in that they allow us to predict explicit 
formulae for the extreme value statistics. It should be emphasized that we see this as the 
first steps in exploring the connections, and that we are aware of the speculative nature of 
many of the predictions we shall make. We see it as a major challenge to explore these ideas 
more rigorously. We put forward these speculations in the hope that they will stimulate new 
directions of research into the long-standing problems we address. 

The structure of this paper is as follows. We summarize our predictions in the next 
section, and also briefly discuss preliminary numerical evidence for ^(1/2 + it) and random 
matrices in support of them. Some of these numerical results were first outlined in our short 
communication Fyodorov, Hiary & Keating 2012, which we refer to below as FHK. As the 
statistical mechanical problems we shall compare to play a central role in this story, and as 
they may be unfamiliar in the context of the connections between random matrix theory and 
number theory, we give an overview of the key concepts and main results in Section 3. This 
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review of the literature is necessarily lengthy because the intuitions coming from statistical 
mechanics are essential to explain our ideas. In Section 4 we show how to employ the statistical 
mechanics methodology in the context of the extreme value statistics of the characteristic 
polynomials of random unitary (CUE) matrices. Finally, in Section 5 we outline brielfy how 
these calculations may be used to motivate predictions for the extreme value statistics for the 
Riemann zeta-function. 



2. Summary of predictions 

As explained in the introduction, we see our main goal in this paper as being to explain 
the analogies between the statistical mechanics of certain disordered systems, in particular 
the freezing transition observed there, and the extreme value statistics of the characteristic 
polynomials of large random unitary matrices and the Riemann zeta-function. These analogies 
lead to a range of predictions, a representative selection of which we list here. These predictions 
follow from the calculations outlined in Sections 4 and 5, and are motivated by similar 
calculations in Statistical Mechanics reviewed in Section 3. 



2.1. Statistics of the moments of the modulus of CUE characteristic polynomials 

Let us define the moments Z^ifi\L) as in Q, and concentrate for simplicity on the case 
L = 2ir. Then for < /3 < 1 and N S> 1 the probability density of the random variable 
z = ZnW; 2tt)/Z £ (P, 2vr), where 



is given by 

Viz) = e~ zW , z < iV 1 ^ 2 -> oo (12) 



with r(x) and G(x) denoting, respectively, the Euler Gamma-function and the Barnes double- 
Gamma functions (which satisfies G(x + 1) = T(x)G(x),G(l) = 1). In the region (3 > 1 
the probability density of the moments is conjectured to change to a much more complicated 
distribution. Defining the scaled moments for all /3 > 1 as z = Zn((3;2it) ^j^—, the most 
salient feature of Viz) for N S> 1 is predicted to be the following tail: 



Viz) oc z hxz, z > 1. (13) 

Both the change of the tail exponent from 1 + -jp to 1 + ^ as well as the presence of the 
logarithmic factor In z in (13) are different manifestations of the freezing transition occuring 
at (3 = 1. They are expected to be universal features for all values of L in the range < L < 2n. 



2.2. Freezing of the mean Free Energy 

Perhaps the simplest consequence of freezing manifests itself in the temperature dependence 
of the free energy. For ease of presentation, we again focus on the case when L = 2tt. Let us 
define the normalized free energy by 

^ ) = "^IoiiV 10gZ7v(/3 ' 27r) - (14) 
When (3 is small, the average of F{f3) with respect to Un £ U{N) is dominated as N — > oo 
by typical values taken by Pn(@), those governed by the central limit theorem (Keating 
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& Snaith 2000). We have seen above that the typical scale for Z^{f3, 2tt) in that case is 
Z e (/3,2vr) = N l+ P\ and so we expect that 

as) 



One the other hand, from (10) we know that for /3 — > oo the free energy is dominated 
by the extreme values taken by pn(6). The latter will be shown below to scale as 
maxg g [ 0j 27r) l°g \pn(8)\ ~ 21ogiV. We therefore expect that as N — > oo 

-E{^(/3)}^2. (16) 

Freezing in this conext simply means that the transition between the two types of 
behaviour occurs at (3 = 1 and is sharp: in the limit N — > oo 

-E{F((3)} = { / + ^ ^ (17) 

Specifically, the term is motivated, after interpreting as the temperature T, by the fact 
that the free energy remains T— independent, i.e. "frozen", below the critical temperature 
T = 1. 

2.3. Statistics of the maximum of the modulus of CUE characteristic polynomials 

The analogy we develop suggests that the maximal value of the modulus of a CUE 
characteristic polynomial Pn{&) in & n interval £ [0, L), < L < 2tt (which contains on 
average Nl = N^- eigenvalues of the associated matrix f/jy) can be written in the limit 
Nl — > oo as 

- 2 max log \pn(0)\ ~ cl Nl + b NL x, (18) 
0e[o,L) 

where a^ L = — 21ogiV£ + clog log iV/, + o(l), with, conjecturally, c = | and &at l = 
1 + 0(1/ log JVi), and where the random variable x is distributed with a probability density 
p(x). The value of c, |, is significant because it is different from the value, h> characterizing 
the extrema of short-range correlated random variables; see the detailed discussion around 



equation (57). The nature of the variable x and the form of the density p(x) both depend on 



the arclength L, and are understood presently only for two specific choices, as detailed below. 

(i) The full-circle case L = 2tt when iV^ = N. Then the random variable x is of order unity 
and its probability density is predicted to be given by 

p(x) = -«4(x) = - J- [le^K^e*' 2 )] = 2e*K (2e*/ 2 ), (19) 

where K v {z) denotes the modified Bessel function of the second kind. 
The function gp c (x) has the following expansion when x — > —oo 

gp c (x)(x) = 1 + e x (x - 1 + 2 1E ) + e 2x { l -x - | + lE ) + .. , (20) 
where 7_e is Euler's constant. 
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(ii) A mesoscopic interval of small arclength L <C 2ir such that Nl 3> 1, so it typically 
contains many zeroes of the characteristic polynomial. In this case we have x = 
uy/—2lnL+y, where u is a standard (mean zero, unit variance) gaussian random variable, 
and y is independent of u and is of order of unity. The probability density function for 
y can be written again as p(y) = —g'p c (y), but g'p c {y) is now given in terms of a contour 
integral: 



9Pc(v) 



' — / e- sy M(s)T(s-l)ds 
2nr J 



(21) 



where the contour is parallel to the imaginary axis s = sq + iu), with sq sufficiently large 
that all singularities lie to the left, and M(s) can be expressed in terms of the Barnes 
function G(x) 



M(s) 



22s 2 +s- 



G(5/2) 2 7r*- 1 r(s)r(s + 2) 



G(s) 



(22) 



The function can be evaluated numerically (Fyodorov , Le Doussal, & Rosso 2009), and 
the cumulants for the random variable y can be evaluated to be < y >= = 



2j E - ln(2vr), < y 2 > c = E{x 2 - < x > 2 } 



4tt 2 
3 



27 

1 - 



and for general n > 3 



< y n >c = (_)™-i( n _i)!( C ( n _i)(2«-4)-C(n)(2 n 3-4)+2 n+1 -l-2" n )(23) 



It is instructive to compare the behaviour of g{y) at y — > — oo with that predicted for the 



full circle (20). We have 



g(y) = 1 + (y + A!)e* + {A + By + Cy 2 



6 



y 3 )e 2 y + 



(24) 



with A' = 2-i E + ln(2vr) - 1 and C = -0.253846, 
Significantly, we see the same asymptotic tail g(y) — 1 ~ 



B = 1.25388, A = -5.09728. 
ye y shared by the two functions, 
but that the higher order terms in (20) and (24) differ. The asymptotic behaviour 
p(x — > — oo) — xe x + . . . is conjectured to be the universal backward tail shared 
by extreme value distribution of all logarithmically-correlated random functions, see 
Carpentier Sz Le Doussal 2001. 



The significance of these results is that they differ from the usual Gumbel distribution, 
which holds for i.i.d. random variables with finite moments (see Section 3). This difference 
is essentially due to the logarithmic correlations exhibited by vj^ (9) and discussed in the 
introduction: had the correlations been short-range (see Section 3), the Gumbel distribution 
would have applied. This difference shows up in the form of the distributions p(x), and in 
particular in the asymptotic decay of p(x) as x — > — oo, but also, importantly, in the value 



of c in (18). Our prediction, c = % which is conjectured to be another universal feature of 



logarithmically-correlated processes, see Fyodorov, Le-Doussal &: Rosso 2012, differs from that, 
^, which would be expected when the correlations are short-range (or absent). 



2.4- High points of characteristic CUE polynomials 



It follows from (18) that the typical value of the maximum of |pat(#)| in the interval 6 E [0,L] 



is of the order of iV^. The simplest quantity that quantifies the structure associated with the 
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high values of \pn(0)\ is the relative length /j,n(x;L) (as a fraction of the total length L) of 
those intervals in [0,L] where \pn(8)\ > A|J, where < x < 1. This can be expressed as 

fi N (x;L) = - [ L X {21og\ PN (6)\ - 2xlog N L }d6, (25) 
L Jo 

where the characteristic function x{ u } = 1 if u > an d zero otherwise. In the language of 



the theory of random processes, quantities similar to (25) are known as sojourn times of the 
random function 2 log |pjv(0)| above the level 2x log Nl. Explicit expressions for the probability 
density of [i = (j,n(x; L) can be provided again in the two limiting cases: 

(i) The full-circle case L = 2tt, when Nl = N. We denote the typical value fJ, e (x) of the 
length fi N (x;L) by 



■2 



^- r 'V ^^j^ - 0<x<1 (26) 

The probability density for the variable £ = hn(x] L)//j, e (x) is then predicted to have the 
following form: 

7>(f) = i_^- 1 -^ e -r^ ) 0<x<l (27) 
x z 

Note that the mean value of the length E {/j,^(x)} = u e (x)T(l — x 2 ) stays finite as x — > 1. 
A direct calculation shows that such an expression for the mean is valid for any x > 0, 
without restricting to x < 1. However, when approaching x = 1 the mean value is 
significantly larger than the typical value u e (x), and is dominated by rare fluctuations. 
This is directly related to the fact that x = 1 is, to leading order, the typical value 
for the highest maximum of \pn(&)\ (the "extreme value region") so the statistics of the 
corresponding length is indeed dominated by rare events. As will be explained, it is such 
a difference between the mean and typical values which helps us to conjecture c = 3/2 in 



(18). 



The nontrivial leading order scaling with N x in (26) is directly related to the 



multifractal-type structure of the measure of intervals supporting high values. The 



formula (27) is expected to be valid for all £ of order of unity, more precisely as long 
as £ -C £ c , with a certain cutoff scale £ c — > oo as N — > oo, the specific form of which is, 
as yet, unknown to us. 

For a mesoscopic interval L such that 1 -C <C N the random variable hn(x;L) 
is distributed as the product of two statistically independent factors: (in(x;L) = 
e xuy, ^ 2lnL fli^(x), with the random variable u being a standard mean-zero unit- variance 



Gaussian. The variable £in(x) has a typical scale f2 e (%) related to (26) by fi e {x 



2 f^eix)) thus sharing the same multifractal scaling of the length of intervals 
supporting high values. The probability density V{£) of the random variable £ = 
fiN{x)/ fj, e (x) is expected to share the powerlaw tail V{£) ~ £ for 1 <C £ <C £ c — > oo 

with the full-circle case, but the exact shape of the distribution will be different. 

Explicitly, let us define M x (s) = E f° r complex s, at fixed < x < 1. Then the 

density V(£) can be written as the contour integral 

V ^) = T2^I Z S M x (s)ds. (28) 

? J Res=const 
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Note that for the full-circle case M x (s) = T (l — x 2 (l — s)) , so that performing the 



integral ( 28 ) by the sum over residues reproduces (|27j) . 
For the mesoscopic case we have: 

T(l + x 2 (s - l))G x (f + i + xs)Gp{^ + xs)G x (^ + £ + xs) 



M x (s) = A x (s) 



G x {x + l+xs) [G x {\+xs) 



(29) 



with 



A x (s) = 2 ( s - 1 )( 2 + :r2 (^+ 1 )) 7 r 1 - s + x )] G ( 2x + x) (3 ) 

^a:(T + i^ClS + x ) G x(~f + 25 ) 

Here r2(z|x) = G x (z) is the Barnes' double T— function (Barnes 1901): for ?R.(z) > 
logG J! (z) = I ~ 2 Q/2 h(2 T ) (31) 



/o t \ v (l-e-/ ? *)(l-e-'/ /3 ) 2 
where Q = x + 1/x. This function satisfies: 

= G 1/X (z), G x (z + x) = x 1 I 2 - xz (2t:)^T(xz) G x (z). (32) 
For x = 1 the function G x (z) coincides with the standard Barnes function G(z) discussed 



after (12). Like the standard Barnes function, G x (z) has no poles and only zeros, and 
these are located at z = —nx — m/x, n,m = 0,1,... We note in passing that G x (z) 
plays a fundamental role in the Liouville model of Quantum Gravity, see e.g. Fateev, 
Zamolodchikov & Zamolodchikov 2000. 

2.5. Extreme values of ((1/2 + it) 

The relationship between the values of |£(l/2 + zi)|, as t varies along the critical line, and those 
taken by |pat(0)| when the corresponding matrix Un is chosen uniformly at random from the 
unitary group U(N) was first considered in Keating & Snaith 2000, where it was argued that, 
statistically, ((1/2 + it) behaves like the characteristic polynomial of a random unitary matrix 
of dimension N ~ log j-. It has since been the subject of a number of studies (see, e.g., Conrey 
et al. 2003, Conrey et al. 2005, Gonek, Hughes & Keating 2007, Conrey et al. 2008), but we 
are still far from a complete understanding. For example, the role played by arithmetic is 
still being elucidated, although the hybrid model of Gonek, Hughes & Keating 2007 suggests 
that at leading order this contribution decouples from the random matrix component. For 
this reason, the way in which the random-matrix predictions listed above model the extreme 
value statistics of ((1/2 + it) is not entirely clear to us. Nevertheless, we believe that formulae 
at least similar to those listed below should hold at leading order. We give our reasons for 
believing this in Section 5. This belief is also supported by preliminary numerical experiments, 
the results of which we present below. 

In the light of the understanding that ((1/2+it) behaves like the characteristic polynomial 
of a random unitary matrix of dimension N ~ log it is natural to expect, approximately, a 
single matrix to model in a statistical sense the zeta function over a range T < t < T + 2tt, as 
such a range contains log J- zeros on average. One can thus consider splitting the critical line 
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into ranges of length 2ir, and modeling each by a different unitary matrix (c.f. Farmer, Gonek 
& Hughes 2007). In each range one can then find the maximum value taken by |C(l/2 + it)\, 
and finally one can consider the distribution of these maximum values for all of the ranges. 
More generally, one can consider ranges T < t < T + L, where L < 2n. 
Thus if 



Cmax(-£') T) 



max |C(l/2 + it)|, 

T<t<T+L 



(33) 



where < L < 2ir, then we can anticipate that log£ max (L;T) is given by (18), with \pn(8)\ 
replaced by ((1/2 + it), the maximum replaced by (33), Nl replaced by ^ log and where 
p(x) is given approximately by the formulae above when L = 2ir and when L « 2ir, 
respectively. Specifically, we expect 



21ogCmax(£;T) ~ -2 log 



*L log L\+ c i og log 



where the random variable x has a value distribution p(x) that is given, when L « 2ir, in 
terms of (21), and, when L = 2tt, that is approximated (because characteristic polynomials 
are 2-7r-periodic, unlike C(l/2 + it)) by (19). This has two significant implications: that this 
formula holds with c = | , rather than c = | , as would be the case if the zeta correlations were 
short-range, and that the tail of the distribution decays like \x\e x as x —> — oo. It is not at this 
stage completely clear to us how, if at all, the arithmetic will modify these expressions, but 
there are reasons, discussed in Section 5, to believe that it will not influence them at leading 
order. 

Furthermore, we expect, with the same identifications, 

1 meas T < t < T + L : 2 log |C(l/2 + it)\ > 2xlog 



Ht(x;L) 



L 



V2^ & 2ir) 



(35) 



to be given by the corresponding expressions listed above. In this case we do expect the scale 



(26) to be multiplied by the arithmetical factor 



a(x) = Yl 



1 



E 

m=0 



T(x + m) 
m\T(x) 



P 



(36) 



that appears in the moment conjectures at leading order (Keating & Snaith 2000). Specifically, 
we expect the value distribution of /j,t(x;L) to be given by the formulae in Section 2.3, but 
with jl e (x) replaced by 

- + 1 (37) 



7rloglog^2xG(l + 2x)r(l 



Finally, we expect to see freezing of the quantity corresponding to the free energy; that 
is, defining 

(■T+2-k 

\Cil/2 + it)\ 2 ^dt (38) 



and 



P log log ^ 



then we expect that the mean of j^(/3) with respect to T satisfies 



j3 < 1 
P > 1 



(39) 



(40) 
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in the limit as T — > oo. 

It is worth remarking that our results relate to extreme values over much shorter ranges 
than those considered by Farmer, Gonek & Hughes 2007 - our focus is on ranges of lengths that 
are 0(1), whereas theirs was on ranges of length T as T — > oo. If one extrapolates (18) to 
ranges of length T (well beyond where we can justify it), our result for the typical scale of the 
extreme values agrees with theirs. Where we are able to go further is in predicting the value 
distribution of the fluctuations. In the context of the question of the extreme values for t < T, 
it is worth noting again that the tail of the distribution we predict for much shorter ranges 
decays like \x\e x as x — > — oo; that is, the exponential is linear rather than quadratic. If this 
were to persist to much longer ranges than we understand at present, it would suggest that 
C(l/2 + it) may take much larger values than the Montgomery heuristic (or, more precisely, 
the Farmer-Gonek-Hughes conjecture) predicts, maybe even close to the upper limit @, but 
there are several reasons for thinking this unlikely. Specifically, it is not difficult to see that in 
the absence of correlations the large deviation tail of the extreme value density again becomes 
quadratic, and we believe the same is also true when log-correlations are present. 



2.6. Numerical experiments 

In order to test the extreme value predictions for the Riemann zeta function we now 
summarize the results of preliminary numerical computations performed by Dr Ghaith Hiary 
and published in our short communication FHK. These involved evaluating £(1/2 + it) over 
ranges of length 2tt, at various heights T, and finding the maximum value £ max (27r;T) in 
each range. Values of C(l/2 + it) were computed using the amortized-complexity algorithm 
Hiary 2011, which is suitable for computing £(1/2 + it) at many points. Point-wise values of 
£(1/2 + it) that were computed are typically accurate to within ±5 x 10 — , which is sufficient 
for the purposes of this experiment. The maximum of |£ (1/2 + it) | between consecutive zeros 
was computed to within ±10~ 9 . 



The first test concerns the value of the constant c in (34). We expect the logarithmic 
correlations to lead to c = |, rather than c = ^, as would be the case if the zeta 
correlations were short-range. The mean of £ max (27r;T) suggested by the model in (34) is 
S = e 7B iV/(log iV)5, with c = 1/2 or 3/2 , and j E = 0.57721 . . . , where we set N to be the 
nearest integer to logT. At each height a sample that spans ~ 10 7 zeros is used yielding 
~ 10 7 /N sample points (since there are roughly N zeros in each range of length 27r). The 
numerics presented in FHK and reproduced below clearly shows that c = 3/2 fits the data 
considerably better than c = 1/2, thus supporting the logarithmic correlations model. 



Table 1. Ratio of data mean 8 to model mean 8 with c = 3/2 and c = 1/2. 



T 


N 


(s/s) 

V /c=3/2 


(s/s) 

V /c=l/2 


1Q 22 


51 


1.001343 


0.504993 


10 19 


44 


0.992672 


0.510293 


10 15 


35 


0.976830 


0.518057 


3.6 x 10 7 


17 


0.930533 


0.552856 



Testing the distribution p(x) is more difficult, because the data converge extremely 
slowly at that scale. The results of some initial experiments were presented in FHK and 
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are reproduced below. Specifically, we considered 

a(T) = -2 log |C max (27r; T) | + 2 log log ^ - \ log log log ~ (41) 

based on a set of approximately 2.5 x 10 zeros near T = 10 28 . The data were normalized 
so that <j(T) has empirical variance = f x 2 p(x) dx = 3.28986813.... The overall agreement 
was supportive of (19), especially in the important tail when x — > — oo and in view of the fact 
that lower order arithmetical terms Keating & Snaith 2000, Conrey et al. 2005 had not been 
incorporated, but it cannot be said to be conclusive at this stage. The behaviour in the tail is 
significant because if it were to persist into the large deviation regime it would suggest that the 
Montgomery heuristic (or, more precisely, the Farmer-Gonek-Hughes conjecture) significantly 
underestimates the maximum values achieved by |C(l/2 + however, as noted above, there 
are strong reasons for believing that it does not persist this far. 




Figure 1. Numerical computation (solid red line) compared to theoretical prediction 



(191 (dashed black line) forp(x). 



Finally, we performed a test of the prediction relating to freezing in the mean free 
energy. This involved calculating Zt{P) and hence the free energy J r ((f3) numerically, and 
then averaging with respect to T over 10 6 values near to T = 10 28 . If freezing is operative, 
— 7^(/3) is expected to be equal to j3 + 1/(3 for f3 < (3 C = 1 and remain frozen to — J^(/3) = 2 for 
all f3 > 1. In order to account for the finite height at which the computations could be carried 
out, it was found to be efficacious to normalize -Zr(/3) so as to incorporate lower terms in the 
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full asymptotic expansion of the moments Conrey et al. 2005, Conrey et al. 2008 rather than 
just the leading-order asymptotic term. Specifically, what was computed was the T-average of 

DtW) =P+ l R + R , \ T log ( - T Z l { ^ T \ \ - ( 42 ) 
P p log log ^ \\og^P^ozh)) 

where Pp{x) denotes the moment polynomial considered in Conrey et al. 2005, Conrey et al. 
2008 (and its extension as an infinite series when /3 takes non-integer values). The results 
shown in Figure [2] would appear to support freezing. 




0.5 



1.0 



1.5 



2.0 



Figure 2. Numerical computation (red dots) compared to the theoretical prediction 
(dashed black line) for Dt{P), suggesting freezing beyond = 1 

Mr Timothee Wintz also assisted us by performing similar numerical experiments on 
randomly generated unitary matrices. Specifically, he computed the maximum values of 
characteristic polynomials of large numbers of unitary matrices drawn uniformly from the 
CUE. He tested the value of c in (18) with the results set out in Table [2j obtained as for the 



zeta- function. There is again good agreement with our conjectured value c = |. He also tested 
the distribution of the random variable x. In this case it proved efficacious to rescale x by a 



factor 6jy = 1 — B/logN, cf. (18), where the constant B was obtained from a best fit. The 



results for a sample of a million matrices with N = 50 are shown in Figure |3j 
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Table 2. Numerical estimation of c in (18 1 for matrices of size N 



AT 

iv 


c 


zu 




30 


1.46107 


40 


1.48018 


50 


1.49072 


60 


1.49890 


70 


1.50756 



0.40 



0.35 



0.30 



0.25 



0.20 



0.15 



0.10 - 



0.05 



0.00 




10 -8 



-6 -4 



-2 



2 4 



Figure 3. Numerical computation (red crosses) for 10 6 matrices with n = 50 compared 



to theoretical prediction ( 19 ) (blue line) for p(x) 



3. Statistical Mechanics in Disordered Landscapes and Extreme Value 
Statistics 



The idea of complicated energy landscapes pervades the theoretical description of glasses, 
disordered systems, proteins, etc., see Wales 1994, and has recently re-emerged in string theory 
and cosmology, see e.g. Susskind 2005. In this respect, the Parisi solution for mean-field 
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spin-glasses is especially important: it reveals that in that case the energy landscape of a 
system of many randomly interacting spins has a surprisingly complex, hierarchical structure 
of valleys within valleys within valleys (for a short recent account see e.g. Parisi 2009 and 
references therein). Such a structure manifests itself in both dynamics and thermodynamics 
via a non-trivial phase transition occuring at some finite temperature T c > 0. Below T c , 
dynamics associated with wandering in this maze of valleys is non-ergodic and shows many 
distinguished features like aging, Bouchaud 1992. Such features are commonly observed in 
real experiments, although the extent to which the Parisi theory describes energy landscapes 
typical for finite-dimensional disordered systems is still a matter of debate. 

Investigating the energy landscape of a real interacting disordered or complex system is 
a notoriously difficult problem, and an important role is played by studying effective single- 
particle counterparts. Here the main goal is to describe the behaviour of the whole complex 
system, or one of its subparts, by focussing on the statistical mechanics of a single point 
particle (or sometimes higher dimensional objects like lines or membranes) moving in a random 
potential, which encodes the complexity of the original system. The hope then is to be able 
to classify the possible types of landscapes and to establish generic, universal properties, not 
unlike those emerging in Random Matrix Theory. The most famous models of this type are the 
Random Energy Model (REM or GREM, Derrida 1981, Derrida 1985) and its later ramification 
describing the model of a polymer on a tree with a disordered potential (Derrida Sz Spohn 1988). 

Recall that the equilibrium statistical mechanics for a system characterized by a (discrete) 
set of energies E\ , . . . , Em and a temperature T > is represented by the set of Boltzman- 
Gibbs probability weights, pi, ■ ■ ■ ,Pm, where 



(3 = 1/T, and we set the Boltzmann constant ks to unity to measure the temperature 
and the energy in the same units. It is clear that at low temperatures, |3 > 1, the set of 
probabilities is dominated by the lowest available energies in the set. It is then not surprising 
that one of the most fundamental questions arising in the landscape paradigm is the problem 
of understanding the statistical properties of low, or even "extreme" (i.e. minimal) energy 
values typical for various classes of disordered landscapes, see Bouchaud J. P. & Mezard M. 
1997; Biroli, Bouchaud, & Potters 2007. Such an understanding is certainly needed for a 
detailed description of the freezing phenomena in systems with disorder, with the spin-glass- 
like arrest being the paradigmatic example. From that angle, the analysis of the statistics of 
minimal-energy configurations of various random systems is attracting a good deal of attention 
at present; for a review of some recent developments related to Tracy- Widom type statistics 
see e.g Dotsenko 2011. 

In mathematics, the distribution of the minimum/maximum V m i n in a sequence ofM > 1 
random variables V±, . . . , Vm is an important research area with numerous applications. The 
classical results in this area can be found, e.g. in Leadbetter, Lindgren, & Rootzen 1982, 
and we attempt to summarize the facts most pertinent to our present study below. If V{ 
are i.i.d. random variables there are only three possible shapes (up to shifts and rescaling) 
of the limiting distribution <&(V) = Prob(V m i n > V) of the minimum Vmin' In particular, 
for i.i.d. variables V{ with all moments finite the relevant distribution has a characteristic 
double-exponential form and is known as the Gumbel distribution. More precisely, there exist 
non-random sequences ajf and bu such that the random variable y = {V m in — au)I^M is 
characterised when M — > oo by the limiting distribution $>(y) = exp{— e y }. In the particular 




i=l 



(43) 
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case of i.i.d. normal, mean zero variables V, with variance E{I^ 2 } = a < oo the sequences 
o-Mi^m behave asymptotically like a a/ ~ — y/2a log M, bM ~ \J ° '/ (2 log M). 

An important general question, relevant for applications, is to what extent, if at all, 
the above picture holds for correlated random sequences. The most complete answer is 
known for Gaussian mean-zero stationary sequences with covariance E{V^Vj} = C(\i — j\). 
It turns out that if C(r) decays to zero faster than 1/logr the limiting distribution of the 
minima is still given by the Gumbel distribution, and the leading-order scaling behaviour for 
a M,bM is the same as for uncorrelated sequences. We will call such variables short-range 
correlated. Not much is known at present beyond the short-range correlated case. There 
has been particular interest in scale-invariant sequences with stationary increments, also 
known as Fractional Random Walks, which have numerous applications. Such sequences are 
conveniently characterized by structure functions E{(Vi — Vj) 2 } oc \i — j\ 2H , with parameter 
< H < 1 known as the Hurst exponent. In particular, scale-invariance implies that the 
typical minimum in such a case should scale for large M as V m in ~ —C(H)M H which 
should be contrasted with the short-range scaling V m in ~ —\J2o\ogM . The probability 
distribution of the minimum is known explicitly only for the case of the standard diffusive 
random walk H = 1/2, see Majumdar and Comtet 2004, when one can exploit path-integral 
methods based on the underlying Markovian structure. Characterizing extreme value statistics 
for non-Markovian random walks H ^ 1/2 remains a considerable challenge; even the constant 
C(H) is not yet known explicitly beyond H = 1/2. 



3.1. Random Energy Model and the two-dimensional Gaussian Free Field 

The results reviewed above can already be used to generate some insight into the equilibrium 
statistical mechanics of a single particle in a disordered landscape. To this end let us 
identify E% o Vi and look at the sequence Vi as representing a set of energies available 
for a particle at various "sites" i = 1,...,M of a disordered system. Recall that the free 
energy F{j3) = — /3 _1 log Z(f3) can be represented as F((3) = U — TS, where U = (Ei) T is 
the mean energy of the system, with standing for the thermal average with respect to 



the Boltzmann-Gibbs measure (43). The quantity S stands for the entropy, which effectively 
controls how the available mean energy is spread over all the available energy levels (i.e. 
over the sites). In particular, the entropy of a system in which all M states are equally 
likely is given, according to the Boltzmann formula, by S = log M. One may then attempt to 
understand the structure of a Boltzmann-Gibbs measure at a crude qualitative level by invoking 
the argument of a competition between the entropic term TS = TlogM and the minimal 
available energy, see e.g. Carpentier & Le Doussal 2001. Recalling that V m i n « —^f2o\ogM 
for short-ranged random sequences, we immediately conclude that for any fixed T > the 
entropic contribution will dominate over the energetic component in sufficiently large systems. 
This should result in a Boltzman-Gibbs measure being spread more or less democratically 
over all the sites of the landscape. The normalisation ^ffiPi = ^ then implies the scaling 
Pi ~ M . In such a situation it is conventional to say that for any T > the system stays in 
the high-temperature phase with a delocalised Boltzman-Gibbs measure. For the long-range 
correlated case, however, the situation is somewhat the opposite: for any temperature T < oo 
and Hurst exponent H > the magnitude of the minimal energy for large enough M grows 
faster than the entropic contribution. In this situation the Boltzmann-Gibbs measure for large 
enough systems will be essentially localised on one or a few sites of minimal energy, with the 
corresponding pi = 0(1), whereas for the majority of the sites pi is expected to be negligible. 
It is conventional to say that effectively such a system is frozen in the low-temperature phase, 
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for all temperatures. 

It is possible to augment the picture just described in several ways. The simplest 
is to rescale the variance a of the short-ranged random potential with M in such a way 
that a ~ logM. We can thus ensure that both the minimal energy and the entropy 
grow logarithmically with M, and therefore which of the two dominates will depend on the 
temperature T. If in addition we assume the random energies to be i.i.d. random variables, the 
resulting model is precisely the much-studied Random Energy Model (REM), introduced by 
Derrida, see Derrida 1981,1985, which is sufficiently simple to be amenable to rigorous analysis 
and has played a paradigmatic role in the statistical mechanics of disordered systems. In 
particular, the model displays a nontrivial "freezing" phase transition at a finite temperature 
T c , which by an appropriate choice of the variance can be made equal to unity. Namely, 
the mean free energy of the system in the limit M — > oo behaves in the high-temperature 
phase T > T c = 1 as E{F(/3)} « - (j3 log M and "freezes" to the minimal value 

E{F(/3)} « — 21ogM for all temperatures below the transition, for < T < 1. 

In fact, both above and below T c the Boltzmann-Gibbs probability measure is neither truly 
localised nor delocalised, but rather provides the simplest example of a random multifractal 
measure. Namely, in the thermodynamic limit M — > oo the weights pi scale differently on 
different sites: pi ~ M~ ai , with the exponents oti filling some interval [ot- , a+] in such a way 
that Yli aj) ~ M^ a \ with a well-defined smooth concave function /(a), see e.g. Fyodorov 
2010; Fyodorov, Le Doussal & Rosso 2012 for a more extensive discussion and further references. 
A detailed analysis of the low-temperature phase reveals even more intricate probabilistic 
structure: the weights pi for T < T c can be described in terms of Ruelle probability cascades, 
see e.g. Bovier 2008. Physicists usually refer to the low-temperature structure arising as one 
reflecting the simplest nontrivial mechanism of a spin-glass type phase transition - the so-called 
one-step spontaneous replica symmetry breaking. A ramified version of the same mechanism 
in more sophisticated spin glass models gives rise to the Parisi picture of hierarchical valleys 
for the effective (free) energy landscape mentioned above. 

Despite the success of Derrida's idea to rescale the variance of the i.i.d. variables with 
log M, inspired by a similar scaling in infinite-dimensional mean-field spin-glass models, such 
a procedure for generating a nontrivial freezing transition looks somewhat artificial from the 
point of view of random landscapes over finite-dimensional lattices. It is therefore worth noting 
that a similar scaling of the variance arises naturally in certain finite-dimensional models of 
physical interest. Chamon, Mudry, k, Wen 1996 revealed how the corresponding landscape 
model emerges in the case of a quantum Dirac particle moving in the two-dimensional plane 
subject to a transverse random magnetic field. They noticed that the profile of a (normalized) 
eigenfunction corresponding to zero energy of the Dirac Hamiltonian has, formally, the shape 
of a (continuous space version of) the Boltzmann-Gibbs measure on x = (x,y) £ M 2 , 
with the role of the random potential V(x) played by the two-dimensional Gaussian Free 
Field (2dGFF). The latter is defined as a mean-zero random Gaussian field in a domain 
D G R 2 such that its covariance is given by E{V (x)V '(x')} = G(x, x'), where G(x, x') is 
the Green function of the Laplace operator A = d 2 + <9 2 in D, with specified conditions on 
the boundary dD. Let us consider, for definiteness, Dirichlet boundary conditions and take 
D to be a two-dimensional disk \z\ < L, where we have employed the complex coordinate 
z = x + iy. The corresponding Green function is then G(zi,Z2) = ~^^°Sjjz^^~- I n 
particular, for any two distinct points z\ and z^ well inside the disk \z\p\ « I we find 
G(z!,z 2 ) = -^log^x^, the latter expression interpreted as the Green function on the 
full two-dimensional plane. To make the construction well-defined from the point of view 
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of an underlying random Gaussian field, one has to ensure its variance is finite by taking 
appropriate care of the divergence when z\ — > Z2- The most natural way is to think of an 
underlying lattice structure, with the lattice spacing a <C L and the random field defined on 
the lattice sites onljjfj We have M ~ (L/a) 2 lattice points inside the disc, and it is consistent 
to require E{l/(x)F(x')}| x ^ x / = — ^ log £ oc logM showing that the 2dGFF indeed naturally 
gives rise to a REM-like scaling of the variance. In contrast to the REM landscape, however, 
the values of the 2dGFF at different lattice points are strongly (logarithmically) correlated. 
Nevertheless it is natural to conjecture that the freezing transition typical for the REM (and 
shared by the model of directed polymers on disordered trees, Derrida & Spohn 1988) will 
also occur in such a situation, and will give rise to a multifractal structure in the associated 
Boltzmann-Gibbs measure, and hence for the zero-energy wavefunction of the Dirac particle. 
Numerical simulations and further analytical studies confirm the validity of such a picture, see 
Chamon, Mudry, & Wen 1996 and Carpentier & Le Doussal 2001. The latter work was the first to 
provide significant insight into the associated statistics of the minima of the regularized 2dGFF 
landscape. In particular, it suggested a powerful, albeit heuristic, real-space renormalization 
group approach. This approach substantiated the claim of a REM-like freezing scenario in 
logarithmically correlated landscapes, and led to a conjecture for the minimum value of such a 
landscape: V m i n = om + omV, where au = (— 21nM+flnmM+o(l)), b M = l + 0(l/ln(M)) 
and the value distribution of the random variable y is given asymptoticially, as y — > oo, by 
<3?(y) ~ 1 — \y\e y , and so is different from the Gumbel expression, for which $(y) ~ 1 — e y . 

In the mathematical literature the probabilistic properties of the extremes of the lattice 
version of 2DGFF have attracted considerable attention, see Bolthausen, Deuschel Sz Giacomin 
2001; Daviaud 2006; Bramson k, Zeitouni 2012. Actually, the above works addressed V max 
rather than V m i n , but the two statistics are obviously trivially related. In particular, Bramson 
& Zeitouni proved that Vmax = 21ogM — | lnlnM + 0(l). This indeed agrees with the leading 
order terms in om- A more detailed characterisation of the extreme value distribution was 
beyond the reach of the methods used, but the work by Davioud provided key insights into 
the very high, almost extreme values of the 2dGFF. Specifically, let us define N^(x), where 
x £ (0, 1), to be the number of lattice points such that the potential Vi satisfies Vi > 2xlogM. 

It turns out that limA/->oo ^"hiNi ^ = ^ ~ %2 ■ This result implies that the typical value of 
N^(x) scales in every realisation roughly as N^(x) ~ M l ~* 2 ' . Such a behaviour is natural 
to interpret again as a kind of multifractal scaling, not unrelated to the multifractality of the 
Boltzmann-Gibbs weights. Note that when x — > 1 the typical number of points above such 
a level becomes of the order of unity, which agrees with the scaling of the extreme values 
described above. We will return to these issues later on in the paper. 

Finally, we mention that a mathematically rigorous framework for dealing with general 
fields and processes with logarithmic correlations was developed in Kahane 1985 and is known 
by the name "Gaussian Multiplicative Chaos". This has undergone substantial development 
in recent years, see e.g. Bacry & Muzy 2003, and has in particular been exploited in the 
context of probabilistic aspects of Quantum Gravity, see e.g. Duplantier et al. 2012 and 
references therein. Very recently such a framework was shown to be of substantial utility also 
for studying Boltzmann-Gibbs measures associated with 1/f noise landscapes, and closely 
related problems, see Arguin & Zindy 2012 and also Astala et al. 2011 and Barral et al. 2012. 

§ It is possible to give a bona fide mathematical construction of the continuous 2dGFF, sec Sheffield 
2007, but for our goals it suffices to rely upon this heuristic approach. 
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3.2. 1/f-noise as a random landscape I: Statistical Mechanics in the high-temperature 
phase 

To attack the problem from a different angle Fyodorov & Bouchaud 2008 and Fyodorov, Le 
Doussal, & Rosso 2009 suggested using the full-plane logarithmic GFF to construct various 
one- dimensional Gaussian random landscapes with logarithmic correlations. The associated 
Boltzmann-Gibbs measures are expected to be qualitatively analogous to those in 2dGFF 
landscapes, but are amenable to much more detailed quantitative analysis. Arguably the 
simplest example of such a one-dimensional landscape can be generated by sampling the values 
of the full-plane 2dGFF along a circle of unit radius parametrized as z = e lt , t G [0, 2ir). One 
is thus led to a 2tt— periodic mean zero Gaussian process V{t) whose covariance for t\ ^ t2 is 
formally given by 

E{V(h)V(t 2 )} = ~ log \e ih - = - 1 log2| sin ifa - t 2 )\. (44) 

By employing the well-known identity: —log (4 sin 2 tl ~ 2 ~ t2 ) = 2^^^ ^ cos n(t\ — t 2 ) we see 
that one representation of V(t) is given by a random Fourier series of the form 

oo 1 

V(t) = V -= [v n e mt + v n e~ int ] , (45) 
*— ' vvrn 

n=l v 

where the coefficients v n ,v n , n = 1,2..., are i.i.d., mean-zero, complex Gaussian variables 



with variance K{v n v n } = 1. In this way we sample every Fourier series (45) according to 



the Gaussian probability weig 

htfC = i e" 3W, ^l As the 

power associated with a given 

Fourier harmonic with index n decays like 1/n, the function V(t), viewed as a time-dependent 
random signal, is a representative of the so-called 1// noise believed to be ubiquitous in Nature, 
see e.g. Milotti 2002. In the present section we shall focus on this particular model, although 
later on in the paper we will also consider another model of 1// noise, which corresponds to 
sampling the values of the full-plane 2dGFF along a finite interval of a straight line. 



Pretending for the moment that (45) defines a well-behaving function and further 
exploiting the identity 

f-2lT f-2TT 

2tt 2 



i/ 2 "/ 2 ^ 128111 !*^ 2 ) \ eintle ~ imt2dtldt2 = ^\ 5n ^ n^0,m^0 (46) 
we see that in the space of functions V{t) defined by (45) we should have 

5(^) = -— ^ ^ ln\2 S m^^- Y ^j\V'(t 1 )V'(t 2 )dt 1 dt 2 = ^\vn\ 2 >0. (47) 



where we have introduced the formal derivative V'(t) = dV JP ■ The relation (47) then implies 



that the Gaussian weight associated with every Fourier series (45) is proportional to e - ^^) 
so that the expected value for any functional <&(V) of 1// noise could be written symbolically 
as a formal "path integral" 

E{$(V)} = J e -^ s{v) $(V) VV (48) 

much in the same way as the expected value of a functional 3>(W) of the standard 
Brownian motion W{t) with respect to the Wiener measure can be symbolically written 
as fe-^ siw) <Z>(W)VW, with S(W) = f[W'(t)] 2 dt, see e.g. Klauder 2003 for historical 
remarks and further references. The crucial difference however is that in the case of the 
Wiener measure, the evaluation of expected values in many practically interesting cases can 
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be rigorously performed by the Feynman-Kac formula, reducing calculations to the solution of 



a differential equation, whereas a mathematically rigorous theory behind (47)- (48) is not yet 
availabldjj] Nevertheless, we eventually will be able to employ heuristic methods to conjecture 
the explicit value of ( 48 ) in a regularized versions of the theory for the particular functional 



<J?(V) = e p ^> e pv ® with real parameters p > and \/3\ < 1; see (60) below. 



The similarity between (45) and (|8j) is very significant in motivating the analogy we 
wish to draw between the statistical mechanical problem under discussion and the value 
distribution of the characteristic polynomials of CUE matrices, especially in the light of the 



discussion following ([8j) . Unfortunately, the random Fourier series ( 45 ) is pointwise divergent 
with probability one, as reflected in the logarithmic divergence of the covariance (44) when 
t\ —7- t2- To be able to use such a function as a random energy landscape it is therefore 
necessary to provide a regularized version of the process with a well-defined variance^ To this 
end, Fyodorov & Bouchaud 2008 constructed a lattice version of the model, called the circular- 
logarithmic model. In this case one replaces the function V(t), t 6 [0, 2ir) with a sequence of 
M random mean-zero Gaussian variables Vi obtained by sampling the 2dGFF equidistantly 
along the unit circle at points Zj = exp(z^) and then multiplying by a constant factor \/2tt; 

that is Vi = \/2ttV {^jf \ , , i = 1, • • • , M. Such a sequence automatically inherits the M x M 



covariance matrix C km = K{V k V m } from (44) so that the off-diagonal entries are given by 



2-7T 

C Mm = -21og|2sin — (k-m)\. (49) 

To have a well-defined collection of Gaussian-distributed random variables we have to ensure 
the positive definiteness of the covariance matrix by choosing the appropriate diagonal entries 
Ckk- A simple calculation shows that one has to choose 

C kk = E{ V k 2 } = 2 log M + W k , with any W k > 0, Vfc = 1, . . . , M . (50) 

In practice it was further assumed that W k = W, Vfe, and finally the limit W — > was taken. 

The crucial observation made in Fyodorov & Bouchaud 2008 was that in the large-M 
limit the positive integer moments K{Z n ((3)} of the partition function Z(j3) = ^2ie~^ Vi 
corresponding to such a landscape converge in a certain temperature range to the Dyson- 
Morris version of the Selberg Integral, see Forrester & Warnaar 2009. More precisely, for /3 < 1 
and positive integer n we have, asymptotically, that 

f M 1+ ^" 2 0(1) forn>l//3 2 
E ™ = 1m* + «^ for 1<n< V (51) 

where T(x) is the Euler Gamma-function. 

From the moments E{Z n (/3)} Fyodorov & Bouchaud were able to reconstruct the 
probability density V(Z) of the partition function Z(f3) in the high-temperature phase 
< (3 < 1. Actually all the statistical properties of the partition function remain invariant 
when changing (3 — > —(3, so we can formally consider f3 to be of arbitrary sign, and define the 

|| See, however, the investigation of the transformation properties of the quadratic forms associated 



with (47 1 in Vershik, Gelfand, Graev 1973 and Neretin 1983. Those objects turned out to be intimately 
related to the representation theory of the group SL(2, R) and the Virasoro algebra. It would be natural 
to expect that such an invariance may play an important role in building a mathematically rigorous 



theory of path integrals of the type ( 48 ) 



Note that precisely the same process and the associated Boltzmann measure was found recently to 
play an important role in constructing the two-dimensional closed conformally-invariant random curves, 
see Astala et al. 2011 
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high-temperature phase by the condition \/3\ < 1. The density V(Z) in this domain turned 
out to consist of two pieces, the body and the far tail. The body of the distribution has a 
pronounced maximum at Z ~ Z e = M 1+/3 /r(l - /3 2 ) < M 2 and is given explicitly by 

nZ) = ~( Z if Z«M 2 , <1 (52) 



„-i- 



i 



The most important feature of the above distribution is the powerlaw decay V{Z) ~ Z x s 
in the parametrically wide region Z e <C Z <C M 2 . For Z S> M 2 the above expression is 
replaced by a lognormal tail 



V^vr/? 2 logM^ V 21 ogM 
where the unknown function R(x) is of the order of unity for x ~ 0(1). It is easy to check 



that (52) and (53) indeed match at Z ~ M . Let us stress once again that this picture is valid 



only in the "high-temperature" phase |/3| < /3 C = 1. 

3.3. 1/f-noise as a random landscape II: fluctuating multifractal patterns of heights 
and the threshold of extreme values 

In this section we review the arguments from the work of Fyodorov, Le Doussal, & Rosso 2012 



allowing one to exploit the high-temperature moments (51) to analyze the statistics of the 
number of "low" or "high" values in such a sequence. In this way we will be able to determine 
the position of the thresholds V± of the extreme values. For the high values, such a threshold 
V+ is defined as the level above which typically we should have only a few (i.e. of the order 
of one) points of the sequence Vi, . . . , Vm when M S> 1. The definition of V- for low values 
is similar, with "above" replaced by "below". We already have mentioned that at the leading 
order we must have V± = ±21ogM. Correspondingly, we will call the value Vi of the sequence 
x—high provided Vi = 2x log M with < x < 1, and similarly define x—low values as those for 
which — 1 < x < 0. Introducing further the notation hi = e * > the condition Vi > xlnM 
becomes equivalent to hi > M x . In the literature the general sets of values hi associated with 
points of the lattice in such a way that they scale in the large-M limit as hi = M Xi , with 
singularity exponents x% filling in a finite interval are called multifractal sets. Their 

most important characteristic is the so-called singularity spectrum f(x) which characterizes the 
asymptotic growth of the counting function N > (x) ~ of the number of x-high points. 

Thus, counting x— high/low values is intimately related to revealing the multifractal structure 
of 1 // noise, and we give a brief account of the procedure below. This will help in quantifying 
the picture outlined in Section 2.1. 

To that end we define the density pm(v) = Ylt=i ^(Yk ~ ylnM) in terms of which the 
counting function is given by N > (x) = InM pM(y)dy. In the large-M limit the density 
can be described by the following multifractal Ansatz: 

n M {y) MfW y 2 
pM W 2VHXl- y 2 /4 )v dnM' m = 1 " T< |y| < 2 (54) 
where n^iy) is a random coefficient of order of unity which fluctuates strongly from one 
realization of the sequence Vi to another. To understand ( f54| ) we note that pm(v) provides 
a direct link between N > (x) and the partition function Z(j3) = ^2ie~^ Vt , as the latter can 
obviously be expressed in terms of the same function as Z{j3) = InM /* p M {y) dy . 



Here it will be convenient to allow (3 to be of any sign. Substituting the Ansatz (54) into the 
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above formula for Z(/3) we can perform the integral over y in the limit lnM> 1 by the Laplace 
method, with the stationarity condition resulting in the relation y = —2/3. We arrive at the 

asymptotic relation Z((3) rj njv/(y = — 2/3) Z e , with Z e = ■ Note that the condition 

\y\ < 2 translates into |/3| < 1. At the same time we know that Z({3) for |/3| < 1 must be 



distributed according to the density (52). To ensure this property we therefore conclude that 
the probability density of the random variable n = tim (y) for a fixed value of y £ (—2, 2) must 
necessarily be of the form 

4 1 4 

W = 117T^ )? -2<y<2. (55) 

We expect this form of the density to be valid as long as n <C n c , with n c being some cutoff 
value diverging for M — > oo. The precise dependence of n c on M, as well as the shape of V y {n) 
for n S> n c , cannot be extracted from the above arguments, although the internal consistency 



of the Laplace method suggests that (55) cannot hold when n ~ M v when v > 0. 



These facts can be now converted to determine the counting function statistics. Indeed, 



substituting (54) into the integral for the counting function we find by the same method 



N > (x) « um{x) Nt{%) with the same random factor nM{x) distributed according to (55) and 
the typical value Nt(x) being given by 

mx) = SSrp - SV4) 5 E{N > (X » fcnW 0< * <2 ' (56) 

The significance of the above consideration is, in particluar, the fact that it allows one 
to determine the precise position V+ of the typical threshold of extreme values. By its 
very definition the threshold is determined by the condition Mt(x) ~ 1 when M 3> 1. A 
straightforward calculation then allows one to show that 

In InM 

V+ = 2x+ InM, where x+ = 2 - c- , with c = 3/2 (57) 

+ + + InM ' ' v ' 

The same large-M asymptotic must also hold for the position of the absolute maximum V m 

of the sequence which is always among a few values above that threshold. The statistical 

properties of V m will be discussed in more detail in the next section. 

Note finally that had we instead decided to use the condition E {N > (x)} ~ 1 this would 



result again in (57) but with c = 3/2 replaced by c = 1/2. The latter value is indeed known to 
be characteristic of short-ranged correlated random sequences. The difference is due to the fact 
that for such sequences the mean and the typical values of the counting function are always 
of the same order, whereas for log-correlated sequences in the vicinity of the threshold the 
typical value Mt{x) becomes parametrically smaller than the mean value K{N > (x)}. Similar 
behaviour is argued to be shared by a broad class of disorder-dominated multifractal processes 
and fields, see Fyodorov, Le Doussal & Rosso 2012 for a more detailed discussion. 

3.4- 1/f-noise as a random landscape III: duality, freezing and statistics of extremes 

Now we turn our attention to the generating function E |exp(— e^ y Z/Z e )\ which will underpin 
many subsequent calculations. It may be checked easily that the leading-order large-M 



behaviour is dominated by the "body" density (52) rather than by the log-normal tail (53), 
and so we find after straightforward manipulations: 

gp(y) = E{exp(-e^Z/Z e )} = E {exp(-e^-^) J (58) 



J exp |-t - ePvt-Pjdt, < /3 < 1 
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where we have employed the notation <j)p = F(/3) — F e {(3) for the deviation of the free energy 
F = — /3 _1 lnZ(/3) from its typical value in the high-temperature phase 

F e = -/T 1 lnZ e = -(/3 + /T 1 ) InM - /T 1 lnr(l - /3 2 ) (59) 

Note that after identifying M -1 Z(/3) as a regularization for the 1/f noise inte gra l 
2^ fo^ e~@ v ^ dt, (58) can be interpreted as the evaluation of the "path integral" (47)-(48) 
for the functional = e~ v ^ SS * e ~ mt) dt 

identity: 



with p > 0, and in this way is equivalent to the 



dt, z e {p 2 



e 2 



E{V 2 (i)} 



r(i -/3 2 



(60) 



expected to hold when \/3\ < 1 for any regularized version of the 2-7r-periodic Gaussian 1/f 
noise with finite variance E{y 2 (t)} < oo. In particular, let us take the limit p — > oo, j3 — > in 
such a way that pf3 2 = fj, < oo. Note that for the periodic 1/f noise (45) we obviously must 



have Jq W V(t) dt = 0. Using this fact we easily find that, in the limit in question, (60) yields 
the identity: 



E 



{ e -^/o^ 2 Wdtj =e -K(o) r(1 + /u) 



(61) 



Ze(P 2 )\ 



(32=0 



^E{y 2 (t)} + r'(l) is a constant depending on the chosen 
In turn, (61) means that the quantity R = j- jQ V V 2 (t)dt (which can be 



d 

dm 



where z' e (0) = 
regularization. 

interpreted as a measure of roughness of the 1/f signal) is Gumbel-distributed: R = z' e (0) + r 
with random r whose probability density is V(r) = 4- exp {— e~ r }. This result was derived 
for the first time in Antal et al. 2001 by a completely different method, and remained, until 
recently, one of only a few explicit results on the statistics of 1// noise. 



The integral on the right-hand side of (58) belongs to a class of special functions that 



enjoyed a detailed investigation in the mathematical literature only a few years ago, see Neretin 
2006. It possesses several non-trivial properties. In particular, it was noticed in Fyodorov, 
Le Doussal, & Rosso 2009 that gp(y) satisfies a remarkable and important duality relation: 
9p(y) = ffi (y) ■ Indeed, after some algebraic manipulations one can rewrite the integral in the 
right-hand side as 



L exp {- 

oo 



(-iy 



n=l 



111 



\dt 

e nf3y T{l - 



1 

2tH 



Rev=e>0 



e~ vy r(l + (3v)T (1 + /3 



dv 



(62) 



n/3 2 



+ e n ?r 



1 



n 

J2 



(63) 



where in the right-hand side formally 1 < 7^ integer, but it is easy to check that in the 
integer the series in fact retains a well defined finite value. 



limit 4j 



Let us again stress that (58) holds only for < /3 < 1 and the duality of the right- 
Instead, there is 
j3 c . We have 



hand side cannot be used to evaluate the left-hand side for f3 > f3 c = 1. 
accumulating evidence that a type of freezing phase transition happens at /3 
already mentioned the simplest instance of that transition at the level of the mean free energy 
which manifests itself (to leading order in M) in the change from the value — (/3 + /3 -1 ) InM for 
/3 < 1 to the temperature-independent value —2 InM for f3 > 1. This freezing of the leading- 
order mean free energy, which follows the pattern of Derrida's uncorrelated REM, was, for the 
present model, rigorously proved recently in Arguin & Zindy 2012. However, Carpentier & Le 
Doussal 2001, followed by Fyodorov & Bouchaud 2008, elevated the idea of freezing far beyond 
the level of the first moment and conjectured it to extend to a much richer freezing scenario: 
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the whole generating function gp{y) 'freezes' to the temperature independent profile g@ = i(y) 
everywhere in the 'glassy' phase /3 > 1. 

This scenario is supported by the following arguments (i) heuristic real-space 
renormalization group calculations (Carpentier & Le Doussal 2001) revealing an analogy to 
the travelling wave analysis of polymers on disordered trees (Derrida & Spohn 1988) where 
such a scenario can be rigorously shown to hold (ii) the duality relation mentioned above can 
be shown to hold also for other types of logarithmic landscapes (Fyodorov, Le Doussal, & Rosso 
2009 and 2010) - in particular, the duality forces the 'temperature flow' of the function gp(x) 
to stop at the critical point /3 = /3 C = 1 - (iii) by relations between the freezing scenario and 
the mechanism of one-step replica symmetry breaking in logarithmic models (Fyodorov , Le 
Doussal, & Rosso 2010, and to be published), and (iv) finally, by direct numerical simulations 
in these papers. All these facts taken together inspire our confidence in the validity of the 
freezing scenario, although it remains a major challenge to prove this conjecture rigorously. 

One of the main consequences of the freezing conjecture is that it implies the possibility of 
obtaining the distribution of the (properly rescaled) free energy in the low-temperature phase. 
Namely, following Fyodorov and Bouchaud, 2008 we make an additional assumption that for 



all j3 > 1 when freezing is operative the expression ( 59 ) for the typical value of the free energy 



F e {j3) should be replaced with the value of the position of the threshold of minimal values 
F e (/3) -)• V- = -21nM + § lnlnM, so that <j>p = F(f3) - V— Introducing again the notation 
p = e@ y and using the fact that for our particular choice of the model gp = i(y) = 2e y l 2 K\(2e y l 2 ) 
where K\{z) denotes the modified Bessel function of the second kind, we see that the freezing 



scenario allows us to rewrite the relation ( 58 ) for all f3 > 1 as 
E jexp 



-pe 



)} 



2p^K x [2p 



(64) 



This formula can be used as a generating function for the random variable = 0s, and can 
be further employed to extract the probability density for that variable in a closed form: 

1 r°° 1 
2^ 



V 



C'LM 



(0) 



r(i + f 



-r 2 (1 + is) ds 



(65) 



d 



i + T 



n=i n\{n 



mil 



n-, 



+ - - 2ip(n + 1) + ( 1 - n- 
n p \ p 



where tp(x) = V (x)/T(x). In particular, as lim^oo [— /3 1 logZ(/3)] = min 



i=l,...,M 



V m i n , one can gain access to the distribution of the minimum of the random potential sequence. 

Vmin + 2 log M — | In In M one finds in the 



p(x) 



2e x/2 K 1 (2e 



2e x K (2e x/2 ) . 



(66) 



Introducing, correspondingly, x = linig^oo 0(/3) 
large-M limit the probability density for the variable x to be 

_d_ r„ 
dx 

Such a distribution is manifestly different from the Gumbel law, which has a probability 
density of the form PGum(x) = — ^ [exp — Be Ay ^ and which holds for short-range correlated 
gaussian random sequences. Moreover, the density p{x) does indeed exhibit the universal 
Carpentier-Le Doussal tail p(x —> — oo) ~ —xe x . One can easily find all cumulants of the 
distribution ( |66| ), see Fyodorov, Le Doussal, & Rosso 2009, e.g. the mean E{x} = —2je, the 
variance [E{x 2 }] c = E{x 2 } - E{x} 2 = ^, etc., [E{x n }] c = (-l) n 2(n - l)\((n) in terms of 
Riemann-zeta values. All this agrees with the available numerics. 
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4. Statistics of extreme and high values of CUE characteristic polynomials. 

The apparent similarity between the circular-logarithmic model discussed in the previous 
section, and the logarithm of the characteristic polynomial of a CUE matrix makes it evident 



that they exemplify two different ways of regularizing the same 2tt— periodic l//-noise (45). 
The latter can be properly defined only as a random generalized function, see Sheffield, 2007. 
The two regularizations are however of a rather different nature: the log-circular model replaces 
1/f noise with a finite sequence of M random variables, whereas the log- modulus of the 
characteristic polynomial for any finite N is a piecewise-continuous random function with 
iV logarithmic singularities in the interval [0, 2ir\. Nevertheless, if the limiting 1/f noise is 
a meaningful object at all, it would be natural to expect the two processes to share the 
same extremal and high/low-level values statistics in the limits logM 3> 1 and logiV S> 1, 
respectively. 

To substantiate this claim we follow the same strategy for characteristic polynomials as 
we did for the log-circular models. Our main object of interest in this section will therefore be 
the moment Q , which is analogous to the partition function discussed in the previous section, 
and in the associated free energy. We shall be interested in various choices of the interval L. 

The first step is to consider the positive integer moments 

E{^(/3;L)}=^^\..^\{| m (0 1 )| 2 ^..|^(^)| 2 / 3 }n^, k = 1,2,... (67) 

where the expectation E{. . .} is with respect to Haar measure on the unitary group U(N). As 
is well known, the expectation value in the integrand is given by a Toeplitz determinant: 



E 



{ M9l )r..| m («| 2S } = ^, 0?OT=det(M<f»)";_ 1 (68) 

L> N (V) 



where 



M (_« = / jWr-j) Yl [ 2 - 2 cos (0 - 6 p )f ft (69) 



2tt 

fMi-3) TT [2 - 2 ens (rh - (9_M^ 

2ir 

The asymptotic behaviour of such a Toeplitz determinant in the limit N — > oo was conjectured 
in Fisher & Hartwig 1968 and proved in Widom 1973: 

- ^(l + fl j* ' 
G(l + 2d) 

r<s 



D (k) 
N 



G(l + 2/3)_ 

where G{x) is the Barnes function. Further progress is possible in two cases. 



4-1. The full circle: L = 2n 



In this case, substituting (70) back to (68) and (67), we see that the resulting expression 
is the standard Dyson-Morris version of the Selberg integral, see Forrester & Warnaar 2007, 
convergent for k < and divergent for larger k. As we have k > 1 the procedure makes sense 

only for j3 2 < 1. Introducing the notation Z e = N 1+ ^ 2 g(i^2/3)r(i-/3 2 ) ' we ^ nc ^ 

E{z k N {P)}=Z k e T{l-kf), k<j 2 . (71) 
To understand how to deal with the case k > jj, we recall that Widom's asymptotic 



formula (70) is valid only when all of the differences \6{ — 6 A remain finite when iV — > oo, 



Freezing Transitions and Extreme Values 



27 



and should be replaced by a different expression when \6i — 9j\ ~ iV _1 . One can check 
that the divergence of the integral for k > 1//3 2 is due precisely to the fact that these near 
degeneracies become important. Relying on our experience with the corresponding situation 
for the circular-logarithmic case suggests that taking into account the correct short-scale cutoff 
cures the formal divergence, but changes the asymptotics of the moments (Z^([3)} with N: 
namely, these become of the order of N 1+k2 ^ 2 for k > (3~ 2 whereas they are of the order 
of N( 1+/3 ) fe for k < (3~ 2 (c.f. the large- iV asymptotics of the moments of the characteristic 
polynomials derived in Keating & Snaith 2000). Such a change of behaviour will lead to a 
log-normal (far) tail in the distribution. 



The above expression (71) for the moments is exactly the same as (51), except that 
the characteristic scale Z e has a different value. This means we can simply translate, 
mutatis mutandis, all results from the regularized circular-logarithmic model to the values 
of characteristic polynomial sampled along the full circle 9 £ (0, 2tt]. 



4-2. A mesoscopic arc: 1 <C Nl 



iVf < N 



Another case when the partition function moments can be explicitly evaluated corresponds to 
an arc of length L that is much smaller than that of the full circle, but still much larger than 
the typical distance between the eigenvalues, 1/N. We will call such an intermediate scale 
mesoscopic. Following the same route as before, in the limit > 1 we have 



1+/?2 G 2 (l + /3) 



N l+0 



G(l + 20) 



o 



L k 



which after expanding the exponents e p 
the form 

k 

£-/3 2 fc(fc-l)+fc 



6U-2/3 2 TT 
r<s j 
1 + i0 p , and further rescaling 6 : 



p 



(72) 
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(73) 



r<s 



Notice that the factor L ^ 2fc2 can be written as E 



e |uV-21nL 



2fSk 



, where the averaging 



is performed over the standard mean-zero gaussian variable u of unit variance. We conclude 
that the characteristic polynomial pn{6) in the interval [0,L] has the same probability law as 



\uyJ-2\nL 



a product of two independent random variables: pn{@) = e 2 
the 'reduced' partition function Zn((3; L) = ^ Jq \pn{0)\ 2 ^ d0 we have 



x Pn(G), such that for 
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r<s 



l-^Udyj (74) 



The integral in the above equation is the standard Selberg integral (Forrester & Warnaar 2008), 
which gives finally 

J=k 



E 



Z, 



k^T 2 [l-(j-l)f3 2 ]T(l-jf3 2 ) 



n 

3=1 



r[2-(fe + j 



where we have introduced the scale Z e 



2)/32] ' 

G 2 (l+/3) 
G(l+2/3)r(l-/3 2 ) 



1< k < 0- 



, with N L = N 



2tt' 



(75) 



For 



the case k > f3~ 2 the integral is divergent and we can apply a consideration similar to that 
applied for the full-circle case, which is of secondary importance for our goals and will not be 
repeated here. 
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We see that the expressions (75) for the moments of z(f3) = Z^jZ e are precisely the 
moments of the distribution analyzed in the paper Fyodorov, Le Doussal, Rosso 2009. That 
paper dealt with the aperiodic version of the l//-noise sampled from the two-dimensional 
full-plane GFF along an interval of unit length. The analysis followed essentially the same 
steps as for the circular-logarithmic model, but technically was much more involved due to the 
complicated structure of the moments. The main difficulty was to find a method of continuing 
to complex k in a meaningful way, which allows one to find the probability density 
of z from the above moments. The goal was successfully achieved in Fyodorov, Le Doussal, 
Rosso 2009 by employing heuristic methods. Independently, a very similar problem arose in 
the context of financial mathematics, and an elegant mathematically rigorous solution was 
proposed in Ostrovsky 2009, see also Ostrovsky 2012, with the results of the two approaches 
coinciding. 

The extreme/high value statistics for the modulus of the characteristic polynomial in the 
interval N^ 1 <C ^ <C 1 can again be translated straightforwardly from those results, and was 
summarized in the beginning of Section 2. 



4-3. Multifractal properties of the modulus of characteristic polynomials 



The results for mesoscopic intervals presented above can be also interpreted as revealing the 
multifractal structure of the function \pn(8)\. To see this more clearly we resort to a standard 
tool of multifractal analysis, box counting, and subdivide the full interval [0, 2ir] into M = 2-n jib 
subintervals ("boxes") of equal length If, chosen in such a way that the number of boxes satisfies 
1 -C M = 2ir/lb <C N. We further define the set of coarse-grained values h n (lb), n = 1, . . . , M 
of the function \pn(9)\ by averaging it over the n— th box and the corresponding "partition 
functions" Cq(h) via 



h n (h) 



nit, 



\ PN (e)\d8, Uh) 



i 



M 



l b J(n-l)l b — i=1 

Now, straightforward calculation gives for integer q the ensemble-averaged value: 



(76) 



E{( q (l b } = M q / E{\p N (ex)\ . . . \ PN (6 q )\} 



d9i 
2^ 



NV 4 G 2 (3/2) 



S q (f3 2 = l/4) I, 



2tt 



Mi- 1 ) 



M\ q 

-J E{2«03 = 1/2;J 6 )} (77) 



1 < q < 4 



(78) 



where we have used the rotational invariance of the ensemble-averaged values, the definition 
(67) and the formula (73), with S q {(3 2 ) standing for the /3-dependent Selberg integral featuring 
in (73) and convergent for 1 < q < (3~ 2 . Though formally derived for the integer q one expects 



the above to be valid for real q < 4 as well, so we finally arrive to the relation 



i 



i 



S q (l/4) l b T \ Tq =-q(q-l), -«Z b «2vr, q < 4 



(79) 



In view of the nonlinear dependence of the exponent r q on q, this scaling of the ratio i q {lb) 
associated with coarse-graining on the scale lb implies multifractality of the underlying function 
\pn{9)\, see e.g. Fyodorov, Le Doussal, Rosso 2012 with discussion and further references 
therein. In fact, multifractality could be immediately anticipated from the presence of the 
lognormal factor e2 Uv/_21nib in the "mesoscopic" statistics of \p{6)\ with an exponent whose 
variance, — \nlb ~ InM, is linear in the log of the number of boxes. 
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4-4- Measure of high points of characteristic polynomials 

The multifractal structure of 1/ f noise also shows up in a somewhat different, but related way 
in the statistics of high values of |pjv(#)|- Note that as follows from our discussion the typical 
value of the maximum of |pat(#)| in the interval 9 £ [0, L] is Nl. The simplest quantity which 
helps to quantify the structure of high values is the relative length /j,n(x;L) (as fraction of 
the total length L) of those intervals in [0, L] where \pn{9)\ > {Nl) x for a fixed < x < 1. 
Statistics of this quantity are naturally related to the statistics of the partition function in 
the high-temperature phase /3 < 1 , as was informally discussed for the case of the log-circular 
model in Section 3. To substantiate the claim for the characteristic polynomials we choose here 
to follow an alternative procedure. Though some parts of the method remain of somewhat 
heuristic nature, we believe it grasps the mathematical structures correctly. It remains a 
challenge to justify it by rigorous methods. 
We start from the definition 

m (x- L ) = l£ X{2 log \ PN (6)\ - 2x log N L }d9, (80) 

where x{ u } = 1 if it > and zero otherwise. Using the Fourier-transform one can rewrite this 
as 

21ogiV L r L d9 
2tt ./n T 



fj, N (x;L) 



dn 



dk e 



-2ikr] log Nl 



\pn(0)\ 



2ik 



(81) 



Our goal is again to calculate the integer moments (//jv(x; L) p ) for p = 1,2, . . .. These can 
be found by the same procedure we used for the "partition function" moments above and are 
given by 

'logiV L V f L d9, 



Mx-LY)={^y jf ^...jf d m ...d Vp l( m ,..., Vp ),(82) 



where 



/oo 
dh... dV~ 2il ° g7VL E?=1 k3 ' r ' 3 (\PN(9i)\ 2lkl • • • \pn{9 p ) 
-oo * 



The ensemble average is again a Toeplitz determinant: 



\p N (9i)\ 2Xl ...\PN(0 k 



|2A, 



DP(P) = det 



JV-l 
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where 



M- 



09) 
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(83) 



(84) 



(85) 



As already noted, the asymptotic behaviour of such a Toeplitz determinant in the limit N 
is given by the Fisher-Hartwig conjecture proved in Widom 1975: 



oo 
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2 Af Ag 



where G(x) is the Barnes function, and the formula is valid for ReXj > 



(86) 

-1/2. For purely 



imaginary \j 
l(rji,...,v P ) - 



ikj, which is our case, it gives 



dh.-.d^e-^^U^+^Yl 
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(87) 
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In the limit log Nl 3> 1 the integral is dominated by the saddle point kj 
which gives 

\p/2 v r , m P 



-irij, Vj = 1, 



•rip) 
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logiVL, 
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(88) 



Substituting this back into (82) one may expect that all n— integrals for log Nl 3> 1 and fixed 
x > will be dominated by the lower limit rjj = x, Vj. To this end we introduce new variables 



by n j = x + 



(fi N (x;Ly 



^2 \fj = 1 

21ogAT L ' V J ± ' 

logiVL\ P/2 



J (2x log JV l )p 



. , p and find for x S> 1/ \/\ogN£ 
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(89) 



The remaining integral can be performed explicitly again in the two limits: 
(i) the full-circle case L = 2ir so that Nl = N and (ii) the mesoscopic interval 1 <C Nl <S -/V. 
In case (i) it is the familiar Dyson-Morris integral which gives T(l — px 2 )/[T(l — x 2 )] p for 
< x 2 < 1/p and diverges otherwise (the same remark on the divergence of the moments of 
the partition function applies here). Collecting all factors, we get 



(vn(x) p ) |io gJ v»i « Mx)f r(i - P x 2 ), o<x 2 < i/p 



where the 'typical' value \i e 



x 



N~ 



for the measure is given by 
G 2 (l + x) 



1 



1 



vrlogiV2xG(l + 2x)r(l 



< x < 1 



(90) 



(91) 



reflects the multifractal-type structure of the measure 



The nontrivial leading scaling with N~ 
of intervals supporting high values. 

Note that the mean value of the measure is given by E{//at(x)} = /x e (x)r(l — x 2 ). It 
obviously stays finite for x — > 1, and a direct calculation shows such an expression is valid for 
any x > 0, without restricting to x < 1. However, when approaching x = 1 the mean value is 
significantly larger than the typical value, and is dominated by rare fluctuations. Recall that 
x + = 1 — clnln N with c = | is conjectured to be the thereshold of extreme values. This claim 
is consistent with the fact that at such a level x the measure supporting high values is of the 
order of fj, e (x) ~ A r_1 , that is comparable with the minimal scale of the problem (the typical 
separation between zeroes) which simply means there are typically of order of one maxima 
above such a level. 

In the whole interval < x < 1 the probability density of fi = /xat(x) can be immediately 
recovered by noticing that the moments of £ = fi/fi e (x) coincide with those of the random 
variable Z/Z e distributed according to (52), with the obvious identification x 2 — > f3 2 . We 
conclude that the total relative length /^/v>iOe) of the intervals supporting high values of 
characteristic polynomial is distributed according to the probability density V(fi) given by 
Such an expression should be valid for all u as long as \i <C 1 and when \x 



(27). 



1 must 



have a sharp cut off, as obviously the fraction of the total length cannot be larger than unity 
in any realization. Note, however, that although the above picture is in good qualitative 
agreement with numerical experiments performed for the circular-logarithmic model, it has so 
far not proved possible to achieve quantitative agreement with ( |27| ) for any realistic numerical 
simulation of 1/ / noise (see Fyodorov, Le Doussal & Rosso 2012). 

For the mesoscopic interval with L < 1 we can proceed similarly, and, again after 
expanding in (89) the integrand assuming 9j <C 1, connect to the corresponding moments 
of the partition function (74), (75) with the obvious change f3 2 — > x 2 . In this way we find 
that the random variable hn{x) is distributed as the product of two independent factors: 
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Hn{ x ) = e xuy/ 21ni /2Ar(x), with the standard normal u and /ijv(^) characterized by the integer 
moments 

Efe(g) p }|iog^>i«Ae(x) p n r[2-(fc + j-2)s2] ' 0<x2<1 /P (92) 



Here the 'typical' value fi e ( x ) is related to the similar scale for the full-circle case (91): 



p, e (x) = . g g He(x), so shares the same multifractal scaling of the length of the intervals 

supporting high values. To restore the probability density V{£) of the random variable 
£ = /2jv(x)//} e (x) we define the generic moments M x {s) = E{£ 1_,s } for any complex s at 
fixed < x < 1. An explicit expression for M x (s) was found in Fyodorov, Le Doussal, & Rosso 
2009 and Ostrovsky 2009, and is given by (|29|). This allows us to represent V(£) as a contour 



integral (28). The dominating features of that distribution, like the powerlaw tail at £ S> 1, 
are the same as for the full-circle case. Numerical verifications of these features are expected 
to be even more challenging. 

5. Motivation for the predictions relating to the extreme value statistics of 

C(l/2 + it) 

The calculations outlined in the previous section are based on estimating the asymptotics of 
the moments of the characteristic polynomials, and using the high moments to determine the 
extreme values. The connections between these moments and those of the zeta function are 
now relatively well understood, at least conjecturally (Keating & Snaith 2000, Conrey et al. 
2004, Conrey et al. 2005, Gonek, Hughes & Keating 2007). For example, to leading order as 
oo 

rT 

\ 2X dt~ a(\)E\p N (9)\ 2X , (93) 



ijf |C(i/2 + if) 



with N = log where a(A) is defined by (36). This allows us to use the previous calculations 



to motivate the predictions for the extreme value statistics for the zeta function. The approach 
follows closely that already detailed in the random-matrix context, and so we limit ourselves 
to outlining the principal steps. 

The analogue of the partition function Zn(P', L) defined by Q is clearly 

h log k f t +L m,2+iy) \ Wdy (94) 



and so the analogue of the moments of the partition function (67) is given by 



i r T / 1 t \ k r t+L r t+L r ^ k 

/ 2^ l0g 2^ / •••/ {lC(l/2 + %)| 2/3 ...|C(l/2 + ^)| 2/3 }n^ (95) 
JTq \ / J t J t j . i 

Interchanging the i-integral with the yj-integrals (using the fact that the premultiplying log t 
factor is slowly varying) produces an integrand of the form 

f-T 



T-T ( 



T 



V / ICU/2 + it + iyi)\ 2p . . . |C(l/2 + it + iy k )\ 2l3 dt. (96) 



This is the analogue of the left-hand side of ( 68 ) . A general expression for shifted moments of 



this kind was conjectured in Conrey et al. 2003, Conrey et al. 2005. This takes the form of a 



Freezing Transitions and Extreme Values 



32 



multiple integral; for example 



G ? c( z i'---' z 2fc)A 2 (zi,...,z 2 fc) 
x e | log £ E -=i *,-*k+i rfzi . . . dz 2k dt + o(l), 



where 



fe fc 

G ? Oi, . . . , z 2fc ) = Ajfc(zi, . . . , z 2 k) Yl II C(l + z i ~ Zk+j), (98) 

»=i j=i 

and ^4^ is another Euler product which is analytic in the regions in which we are interested. 
The leading order asymptotics of such integrals was shown in Conrey et al. 2003, Conrey et 
al. 2005, Kosters 2010, Chandee 2011 to take a form analogous to (70), but with an additional 
factor a(P) k , where a(/3) is defined by (36). The calculation then proceeds exactly as in the 
previous section; for example, the arithmetic factor multiples Z e . In determining the extreme 
value statistics, however, the fact that the freezing transition occurs at = 1 and that a(l) = 1, 
suggests that the arithmetical factor does not contribute at leading order. In the computation 
of the measure of large values this factor remains. 

It should be emphasized that this calculation only concerns the leading order asymptotics. 
It is known that the lower order terms contribute significantly to the moments and are 
necessary to model numerical computations. This is because the moments are long-range 
statistics and so are more sensitive to lower-order corrections than local statistics. It would 
presumably be important to develop a method to calculate these lower order terms in order 
to develop a more accurate model for p(x) at a finite height up the critical line. 
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Appendix A: Small-distance behaviour of the two-point correlation function of 

RelogC(| + rt). 

We have from the Euler product that 

io gC ( S ) = -£i°g(i--M=EE-^ (99) 

where the p— summation includes all prime numbers. According to the Prime Number 
Theorem, the number ir(x) of primes smaller than x grows as ir{x) ~ when x — > oo. 
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This can be interpreted as implying that the probability that a large integer n is prime is 
asymptotically 1/logn, see Heath-Brown 2005 for an introduction. Moreover, sums over primes 
of the type Yl p f(p) that are dominated by large primes can be approximated by J°° dx. 
Substituting (|99|) into the two-point correlation function we obtain 



V t i0 (x 1 )V t {0 (x 2 ) 



oo 

V V — - 

^ ^ nin 2 r) ni / 2 T) n2 /2 

Pl,P3TH,2=l Pi 1 J 2 



(cos [ni(t + xi) logpi] cos [n 2 (t + x 2 ) logp2]) (100) 



Further rewriting the product of cosine factors as 



1 



cos 



p nl 

tlog^j- + n\ xi logpi 

P 2 



n 2 x 2 logp 2 



+ ^cos [t log p^p^ 2 



n\ x\ logpi + n 2 x 2 \ogp 2 ] 



we see that second term will always give rise to rapid oscillations with t, whereas the first 
term does the same except when p™ 1 = p% 2 ■ We conclude that the dominating contributions 
come from those non-oscillating terms in the sum: pi = p 2 = p; n% = n 2 = n, resulting in the 
following "diagonal approximation" for the two-point correlation function: 



1 °° 1 



2 < — ' < — ' n"p" 

p n=l 



(101) 



Using 4 



+ an d denoting x = \x\ — x 2 \ the relation(|101l) can be rewritten as 



1 1 11 

V t i0 (xi)V t {0 (x 2 )} - ^^2^—^cos[nx logp] + - ^ Yl "^T cos t n x logp ] 

*> 1 P p n=2 P 



p n=l 



(102) 



Now notice that for any n > 2 the sums ^2 p ^ are convergent, as the integrals 
/°° .r" w .r ^ X are convergent. This fact implies that the second sum in (102) has a finite 



x n log X' 

value when x 



0. As to the first sum, invoking (99) one observes that it can be related 
exactly to the Riemann zeta-function ((s) along the line s = 1 + ix, that is ^Relog£ (1 + ix). 
Recall finally that £ (1 + ix) | x -s>o ~ jz, implying in the limit x — >• the relation 



(ttf (siW (0 (a*)) 



-RelogC(H-*|a?i-x 2 |)+0(l) 



•- log |xi - x 2 |(103) 



Of course, the logarithmic form of correlations cannot hold for arbitrary small x = \xi — x 2 \; 
(0 1 2 \ 

V t (x) ) must be obviously finite. As is well known (see, e.g., Keating 1993) the 

diagonal approximation works only for p < and breaks down for larger primes. Taking 
this into account one can show that (103) holds as long as \x\ — x 2 \ 3> whereas 

2\ 

y i log log t for i — )■ oo. 

The fact that F t (C) (x) has a gaussian distribution follows from a similar analysis of the 
higher moments. Essentially, the diagonal terms dominate and so the prime sums representing 
the higher moments reduce to powers of the second moment in the same way as those of a 
gaussian. 



V t {0 (x) 
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